Basic Statistics

Raw Counts

Name Value
Rows 2,644
Columns 400
Discrete columns 9
Continuous columns 391
All missing columns 0
Missing observations 412
Complete Rows 2,232
Total observations 1,057,600
Memory allocation 6.7 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 4 columns ignored with more than 50 categories.
## player: 2520 categories
## nationality: 102 categories
## squad: 98 categories
## Attendance: 98 categories

QQ Plot

## Warning: Removed 153 rows containing non-finite values (stat_qq).
## Warning: Removed 153 rows containing non-finite values
## (stat_qq_line).

Correlation Analysis

## 4 features with more than 20 categories ignored!
## player: 2143 categories
## nationality: 98 categories
## squad: 83 categories
## Attendance: 83 categories
## Warning in cor(x = structure(list(ID = c(21L, 390L, 430L, 737L,
## 770L, 826L, : la deviazione standard è zero

Principal Component Analysis

## 4 features with more than 50 categories ignored!
## player: 2143 categories
## nationality: 98 categories
## squad: 83 categories
## Attendance: 83 categories
## Warning in plot_prcomp(data = structure(list(ID = c(21L, 390L, 430L, 737L, : The following features are dropped due to zero variance:
##  * Season_201920.